Validity threats: overcoming interference with proposed interpretations of assessment data.

نویسندگان

  • Steven M Downing
  • Thomas M Haladyna
چکیده

CONTEXT Factors that interfere with the ability to interpret assessment scores or ratings in the proposed manner threaten validity. To be interpreted in a meaningful manner, all assessments in medical education require sound, scientific evidence of validity. PURPOSE The purpose of this essay is to discuss 2 major threats to validity: construct under-representation (CU) and construct-irrelevant variance (CIV). Examples of each type of threat for written, performance and clinical performance examinations are provided. DISCUSSION The CU threat to validity refers to undersampling the content domain. Using too few items, cases or clinical performance observations to adequately generalise to the domain represents CU. Variables that systematically (rather than randomly) interfere with the ability to meaningfully interpret scores or ratings represent CIV. Issues such as flawed test items written at inappropriate reading levels or statistically biased questions represent CIV in written tests. For performance examinations, such as standardised patient examinations, flawed cases or cases that are too difficult for student ability contribute CIV to the assessment. For clinical performance data, systematic rater error, such as halo or central tendency error, represents CIV. The term face validity is rejected as representative of any type of legitimate validity evidence, although the fact that the appearance of the assessment may be an important characteristic other than validity is acknowledged. CONCLUSIONS There are multiple threats to validity in all types of assessment in medical education. Methods to eliminate or control validity threats are suggested.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior

Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...

متن کامل

Validation of educational assessments: a primer for simulation and beyond

Background Simulation plays a vital role in health professions assessment. This review provides a primer on assessment validation for educators and education researchers. We focus on simulation-based assessment of health professionals, but the principles apply broadly to other assessment approaches and topics. Key principles Validation refers to the process of collecting validity evidence to ...

متن کامل

Validity Threats in Empirical Software Engineering Research - An Initial Survey

In judging the quality of a research study it is very important to consider threats to the validity of the study and the results. This is particularly important for empirical research where there is often a multitude of possible threats. With a growing focus on empirical research methods in software engineering it is important that there is a consensus in the community on this importance, that ...

متن کامل

Comparison of satisfaction with post-operative pain management and level of functional interferance in addicted and non-addicted patients

Introduction: Postoperative pain relief in addicted patients compared with non-addicted patients is often more challenging and they usually suffer from inadequate postoperative pain relief. Objective: The purpose of this study was to compare satisfaction with postoperative pain management in addicted and non-addicted patients and level of pain interference with their functions. Methods: In th...

متن کامل

HSE Cultural Assessment by SWOT-AHP Case study : Shirvan combined cycle power plant.

This research was aimed at evaluating and promoting the HSE culture in the Shirvan combined cycle power plant. 125 people were selected as the statistical population and the hypotheses were tested. Based on the results of the IFE 663/2 matrix and the EFE 875/2 matrix, there is evidence of overcoming strengths over weaknesses and opportunities for threats. The aggressive strategy was declared ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Medical education

دوره 38 3  شماره 

صفحات  -

تاریخ انتشار 2004